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WHAT IS CLAIMED IS: 

1. A method of mapping a set of input patterns to an m- 
dimensional space, comprising the steps of: 

(a) selecting k patterns from said set of input patterns to form a 
subset of pattems{pi, i = 1, k}; 

(b) determining at least some pairwise relationships between at 
least some of the patterns in said subset of patterns {pi}; 

(c) mapping the patterns {pi} into a set of images in an m- 
dimensional space (pi ^ yi, i = 1, 2, ... k, yi G R") so that at 
least some of the pairwise distances between at least some of 
the images {yi} are representative of the relationships of the 
respective patterns {pi}; 

(d) determining a set of n attributes for each pattern in said subset 
of patterns {pi}, {Xj, i = 1, 2, ... k, Xj e R"}; 

(e) forming a training set T = { (Xi, yO, i = 1 , 2, ... k} ; 

(f) using a supervised machine learning technique to determine a 
mapping function based on the training set T; and 

(g) using said mapping function determined in step (f) to map 
additional patterns. 

2. The method of claim 1, wherein said mapping function is 
encoded in at least one neural network. 

3. The method of claim 1, wherein step (f) further comprises the 
steps of: 

(i) determining c n-dimensional reference points, {Ci, i = 1, 2, ... c, 
CiG R"}; 

(ii) partitioning T into c disjoint clusters Q based on a distance 
function d, {Cj = {(xi, yi): d(Xi, Cj).< <i(xi, Ck) for all k j; j = 1, 
2, ... c;i = l,2, ... ^};and 
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(iii) using a supervised machine learning technique to determine c 
independent mapping functions {f^, i = 1, 2, ... c}, based on the 
respective subsets Q of the training set T. 

4. The method of claim 3, wherein each said mapping function is 
encoded in at least one neural network. 



5. The method of claim 3, wherein step (g) further comprises the 
steps of: 

(i) for an additional input pattern p, determining a set of n 
attributes x, x e R"; 

(ii) determining the distance of x from each said reference point 
{ci}; 

(iii) identifying the reference point cj closest to x; and 

(iv) mapping x y, y G R"", using the mapping function 
associated with the reference point cj identified in step (iii). 

6. The method of claim 5, wherein each said mapping function is 
encoded in at least one neural network. 



7. The method of claim 3, wherein step (i) is performed using a 
clustering algorithm. 



8. The method of claim 1, wherein step (f) further comprises the 
steps of: 

(i) determining c m-dimensional reference points, {Ci, i = 1, 2, ... 
c, CiG R"}; 

(ii) partitioning T into c disjoint clusters C, based on a distance 
function d, {Cj = {(xj, yi): d(yi, Cj) < d(yi, Ck) for all k ^ j; j = 1, 
2, ... c;i = l,2, ... k}}; 
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(iii) using a supervised machine learning technique to determine c 
independent local mapping functions {f\^, i = 1, 2, ... c) based 
on the respective subsets Q of the training set T; and 

(iv) using a supervised machine learning technique to determine a 
global mapping function/^, based on the entire training set T. 



9. The method of claim 8, wherein each said mapping function is 
encoded in at least one neural network. 



10. The method of claim 8, wherein step (g) further comprises the 
steps of: 

(i) for an additional input pattern p, determining a set of n 
attributes x, x e R°; 

(ii) mapping x — » y', y' G R"", using the global mapping function 

(iii) determining the distance of y' to each reference point in {Ci}; 

(iv) identifying the reference point cj closest to y ' ; and 

(v) mapping x — > y, y G R™, using the local mapping function f^^ 
associated with the reference point cj identified in step (iv). 



11. The method of claim 10, wherein each said mapping function 
is encoded in at least one neural network. 



12. The method of claim 8, wherein step (i) is performed using a 
clustering algorithm. 



13. The method of claim 1, wherein step (c) further comprises the 
steps of: 

(i) selecting a subset of patterns from { Pi } ; 

(ii) revising the positions of the images of said selected subset of 
patterns in the m-dimensional space based on the relationships 
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between said selected subset of patterns determined in step (b); 
and 

(iii) repeating steps (i) and (ii) for additional subsets of patterns 
from {pi}. 

14. The method of claim 1, wherein step (d) comprises the step of: 
(i) determining a set of n attributes for each pattern in said subset 
of patterns {pi}, {Xi, i = 1, 2, ... k, Xj e R"}, wherein said 
attributes represent the relationships of each pattern in said 
subset of patterns with respect to n other reference patterns. 

15. The method of claim 1, wherein step (b) comprises the step of 
receiving pairwise relationship data from a subject. 

16. The method of claim 15, wherein step (b) comprises the steps of: 

(1) randomly selecting two patterns from the plurality of 

patterns; 

(2) presenting the two patterns to at least one subject; and 

(3) receiving pairwise relationship data from said subject 
about the patterns. 

17. The method of claim 16, further comprising the step of: 

(4) repeating steps (b)(1) through (b)(3) for additional pairs 

of patterns. 

18. The method of claim 17, wherein step (b) further comprises the step of 
receiving data about the subjects providing the pairwise relationship data. 



19. The method of claim 1, wherein step (b) comprises the steps of: 

(1) receiving pairwise relationship data via a 
communications path coupled to a computer; and 
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(2) storing the received pairwise relationship data in a 

memory. 

20. The method of claim 1, wherein step (b) comprises the steps of: 

(1) selecting a pair of patterns for similarity comparison; 

(2) transmitting information about the selected pair of 
patterns to a remote computer system; and 

(3) receiving pairwise relationship data about the selected 
pair of patterns from the remote computer system. 

21. The method of claim 20, further comprising the step of: 

(4) repeating steps (b)(1) through (b)(3) for additional pairs of 
compounds. 

22. The method of claim 21, wherein step (b)(1) comprises the step of 
selecting the pair of patterns at random. 

23. A computer program product comprising a computer useable medium 
having computer program logic recorded thereon for enabling a processor to 
obtain pairwise relationship data about the selected pair of patterns, said 
computer program logic comprising: 

a selecting procedure that enables the processor to select a plurality of 
patterns from a database for similarity; 

a transmitting procedure that enables a processor to transmit selected 
patterns to a remote computer for similarity comparison; and 

a receiving procedure that enables the processor to receiving similarity 
data about patterns transmitted to the remote computer. 

24. The computer program product of claim 23, further comprising: 

a storing procedure that enables the processor to store received 
similarity data for subsequent retrieval. 
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25. The computer program product of claim 24, wherein said selecting 
procedure randomly selecting patterns from the database. 

26. A system for mapping a set of input patterns to an m-dimensional 
space, comprising: 

means for selecting k patterns from said set of input patterns to form a 
subset of pattems{pi, i = 1, k}; 

means for determining at least some pairwise relationships between at 
least some of the patterns in said subset of patterns {pj}; 

means for mapping the patterns {pi} into a set of images in an m- 
dimensional space (p; yj, i = 1, 2, ... k, yi e R"") so that at least some of the 
pairwise distances between at least some of the images {yi} are representative 
of the relationships of the respective patterns {pi}; 

means for determining a set of n attributes for each pattern in said 
subset of patterns {pi}, {xi, i = 1, 2, ... k, Xi e R°}; 

means for forming a training set T= {(Xi, yi), i = 1, 2, ... k}; 

means for using a supervised machine learning technique to determine 
a mapping function based on the training set T; and 

means for using said mapping function determined in step (f) to map 
additional patterns. 

27. A computer program product comprising a computer useable medium 
having computer program logic recorded thereon for enabling a processor to 
obtain map a set of input patters to an m-dimensional space, said computer 
program logic comprising: 

a procedure that selects k patterns from said set of input patterns to 
form a subset of pattemsjpi, i = 1, . . ., k}; 

a procedure that determines at least some pairwise relationships 
between at least some of the patterns in said subset of patterns {pi}; 
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a procedure that maps the patterns {pi} into a set of images in an m- 
dimensional space (pi yj, i = 1, 2, ... k, yj e R"") so that at least some of the 
pairwise distances between at least some of the images {yj} are representative 
of the relationships of the respective patterns {pi} ; 

a procedure that determines a set of n attributes for each pattern in said 
subset of patterns {pi}, {xj, i = 1, 2, ... k, Xi e R"}; 

a procedure that forms a training set T= {(Xi, yi), 1 = 1,2, ... k}; 

a procedure that uses a supervised machine learning technique to 
determine a mapping function based on the training set T; and 

a procedure that uses said mapping function determined in step (f) to 
map additional patterns. 



